Validating Documentation with Domain Ontologies
نویسندگان
چکیده
Do we always use the same name for the same concept? Usually not. While misunderstandings are always troublesome, they pose particularly critical problems in software projects. Requirements engineering deals intensively with reducing the number and scope of misunderstandings between software engineers and customers. Software maintenance is another important task where proper understanding of the application domain is vital. In both cases it is necessary to gain (or regain) domain knowledge from existing documents that are usually inconsistent and imprecise. This paper proposes to reduce the risk of misunderstandings by unifying the terminology of the different stakeholders with the help of an ontology. The ontology is constructed by extracting terms and relations from existing documents. Applying text mining for ontology extraction has an unbeatable advantage compared to manual ontology extraction: Text mining detects terminology inconsistencies before they are absorbed in the ontology. In addition to this, the approach presented in this paper also introduces an explicit validation of ontology gained by text mining. 1. Documents are Always Inconsistent Usually, some kind of requirements document is written in the beginning of a software project. After requirements elicitation, one of the first tasks of the software developer is to understand the requirements document which includes trying to understand the terminology used. But practical experiences show that apart from being imprecise, requirements documents also use inconsistent terminology. A simple steam boiler specification [1], written for a formal methods contest, for example, looked extremely precise at first glance. However, the document called the same measuring unit in different places “water level measurement device”, “water level measuring unit”, “device to measure the quantity of water”, . . . . Obviously, this unwanted obfuscation hampers understanding of the domain. The reader can not be sure whether there is just one unit or two or three different devices. And of course, real life specifications, not written for an academic formal methods contest, are very likely even less consistent. Furthermore, real life documents are usually much longer rendering manual detection and resolution of such inconsistencies virtually impossible. 1Correspondence to: Leonid Kof, Fakultaet fuer Informatik, Technische Universitaet Muenchen, Boltzmannstr. 3, D-85748, Garching bei Muenchen, Germany Tel.: +49 89 289-17834; Fax: +49 89 289-17307; E-mail: [email protected].
منابع مشابه
Validating Top-level and Domain Ontology Alignments using WordNet
Matching domain and top-level ontologies is an important task but still an open problem in the ontology matching field, particularly due to their different levels of abstraction. Beyond that, validating candidate alignments is crucial before exploiting them within ontology construction and integration processes involving such kinds of ontologies. This paper concerns the automatic validation of ...
متن کاملFrom Software APIs to Web Service Ontologies: A Semi-automatic Extraction Method
Successful employment of semantic web services depends on the availability of high quality ontologies to describe the domains of these services. As always, building such ontologies is difficult and costly, thus hampering web service deployment. Our hypothesis is that since the functionality offered by a web service is reflected by the underlying software, domain ontologies could be built by ana...
متن کاملThe Research on Model Transformations, Based on Domain Metamodel, for Designing Requirements Specifications
ion [26]. 3 Roles of ontologies in software engineering Ontologies, for software design and development, can be used with the following objectives [29] [26]: • Specification: ontologies are used to specify either the requirements and components definitions (informal use) or the system ́s functionality. • Confidence: ontologies are used to check the system ́s design. • Reusability: ontologies coul...
متن کاملOntology-based Land Degradation Assessment from Satellite Images
In this paper, we introduce the idea of documenting operational chains for land degradation assessment using ontologies. We believe that this will help end-users in better understanding the land degradation characteristics and evaluate the results of the assessment process. Since the application domain is wide, various operational chains for land degradation assessment and their associated docu...
متن کاملDefinitions Management: A Semantics-Based Approach for Clinical Documentation in Healthcare Delivery
Structured Clinical Documentation is a fundamental component of the healthcare enterprise, linking both clinical (e.g., electronic health record, clinical decision support) and administrative functions (e.g., evaluation and management coding, billing). Documentation templates have proven to be an effective mechanism for implementing structured clinical documentation. The ability to create and m...
متن کاملLearning Web Service Ontologies: an Automatic Extraction Method and its Evaluation
Successful employment of semantic Web services depends on the availability of high quality ontologies to describe the domains of these services. Building such ontologies is difficult and costly, thus hampering Web service deployment. As a solution, we describe an automatic extraction method that learns domain ontologies from textual documentations attached to Web services. We evaluate our metho...
متن کامل